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ABSTRACT 

We present the results of observations of 1882 sources in the Bolocam Galactic Plane Survey 
(BGPS) at 1.1 mm with the 10m Heinrich Hertz Telescope simultaneously in HCO+ J = 3 — 2 and 
N2H+ J = 3 - 2. We detect 77% of these sources in HCO+ and 51% in N2H+ at greater than 3c7. 
We find a strong correlation between the integrated intensity of both dense gas tracers and the 
1.1 mm dust emission of BGPS sources. We determine kinematic distances for 529 sources (440 
in the first quadrant breaking the distance ambiguity and 89 in the second quadrant) We derive 
the size, mass, and average density for this subset of clumps. The median size of BGPS clumps is 
0.75 pc with a median mass of 330 (assuming T^^gt = 20 K). The median HCO+ linewidth is 
2.9 km s"""^ indicating that BGPS clumps are dominated by supersonic turbulence or unresolved 
kinematic motions. We find no evidence for a size-linewidth relationship for BGPS clumps. We 
analyze the effects of the assumed dust temperature on the derived clump properties with a Monte 
Carlo simulation and we find that changing the temperature distribution will change the median 
source properties (mass, volume-averaged number density, surface density) by factors of a few. 
The observed differential mass distribution has a power-law slope that is intermediate between 
that observed for diffuse CO clouds and the stellar IMF. BGPS clumps represent a wide range of 
objects (from dense cores to more diffuse clumps) and are typically characterized by larger sizes 
and lower densities than previously published surveys of high-mass star-forming regions. This 
collection of objects is a less- biased sample of star- forming regions in the Milky Way that likely 
span a wide range of evolutionary states. 

Subject headings: Stars: formation - Galaxy: structure, kinematics and dynamics - ISM: clouds 
- Submillimeter: ISM - Surveys 



1. Introduction 

Stars form out of clouds of dense molecular gas and dust. From detailed studies of nearby molecular 
clouds, we have developed a picture of how stars with masses typically close to our Sun's form and evolve 
onto the main sequence (e.g., Shu, Adams, & Lizano 1987). A corresponding picture does not exist for the 
highest mass end of the stellar-mass spectrum. This is in part due to high-mass star-forming regions in 
our Galaxy being at greater distances, and thus being observed at lower spatial resolution, than low-mass 
regions. High-mass stars also form in highly clustered environments, whereas the well-studied nearby low- 
mass stars are typically more isolated (e.g. Taurus). Not even the basic formation mechanism of massive-star 
formation (scaled-up version of monolithic core collapse vs. competitive accretion formation; Shu et al. 1987; 
Bonnell & Bate 2006; McKee & Ostriker 2007) is yet well agreed upon, especially for the formation of the 
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highest mass stars. It is possible that both processes are important in different regimes of the stellar mass 
spectrum. For low-mass stars, there are observational indicators of the evolutionary state of the protostar 
(e.g. Thoi - Temperature of a blackbody with a peak at the flux weighted mean frequency of the spectral 
energy distribution and ajji - IR spectral index, defining the Class 0, I, II, III system; Lada 1987, Evans 
et al. 2009); a universal evolutionary sequence for high-mass stars is still being developed and that exact 
ordering of the possible observational indicators (e.g., the presence of a II2O maser or a CII3OII Class I or 
Class II maser, e.g. Plume et al. 1997; Shirley et al. 2003; De Buizer et al. 2005; Minier et al. 2005; Elhngsen 
et al. 2007; Longmore et al. 2007; Purcell et al. 2009; Breen et al. 2010) is still debated. 

One observational aspect that has limited our complete understanding of star formation is that we lack 
a complete census of the star-forming regions in our own Galaxy and, therefore, a census of their basic 
properties (size, mass, luminosity). Previous surveys of star-forming regions have been heavily biased. For 
instance, the earliest studies of dense molecular gas focused on known (optical or radio) H II regions where 
an O or B spectral type star had already formed. The discovery and cataloguing of UCH II regions (e.g. 
Wood & Churchwell 1989) extended studies to an earlier embedded phase, but still required the presence 
of a forming high-mass star. Infrared Dark Clouds (IRDCs), clouds of dust and gas that are opaque at 
mid-infrared wavelengths (i.e. 8 /im), permitted less-biased studies of star forming regions through the 
earliest (prestellar) phases and across the stellar mass spectrum (Carey et al. 2009; Peretto & Fuller 2009); 
however, they were limited to clouds at near kinematic distances and typically observable only in the inner 
Galaxy (—60 < ^ < 60 degrees). Dust continuum observations at far-infrared through millimeter wavelengths 
provide the least-biased means to survey star forming regions at all embedded evolutionary phases and a 
wide range of the stellar mass spectrum across the Milky Way Galaxy since the emission is optically thin, 
always present, and can trace small amounts of mass. 

In the past decade, new surveys of the Milky Way Galaxy have been made from mid-infrared through 
millimeter wavelengths. Several Galactic plane surveys are published or currently being observed, including 
the Bolocam Galactic Plane Survey (BGPS: Aguirre et al. 2011), the APEX Telescope Large Area Survey of 
the Galaxy (ATLASGAL: SchuUer et al. 2009), and the Herschel infrared Galactic Plane Survey (Hi-GAL: 
Molinari et al. 2010). The goals of these surveys are to look for the precursors to massive star formation 
in the Galaxy as a whole, without targeting individual regions known to contain forming stars. They are 
an integral part of completing a full census of star-forming cores and clumps in the Milky Way as they are 
sensitive to star formation at all stages. 

The Bolocam Galactic Plane Survey is a 1.1 mm continuum survey of the Galactic plane (Aguirre et al. 
2011). Covering 220 square degrees at 33" resolution, the BGPS is one of the first large-area, systematic 
continuum surveys of the Galactic plane in the millimeter regime. The BGPS spans the entire first quadrant 
of the Galaxy with a latitude range of |5| < 0.5 degrees from the Galactic plane and portions of the second 
quadrant (Aguirre et al. 2011). The survey has detected and catalogued approximately 8400 clumps of dusty 
interstellar material (Rosolowsky et al. 2010). The BGPS is beam matched to the spectroscopic data we are 
taking in this paper. This allows us to easily compare the gas and dust in the same phase of star formation. 
The BGPS data are available in full from the IPAC websit^El 

The vast majority of sources detected in the BGPS represents a new population of dense, potentially star- 
forming clumps in the Milky Way. The basic properties of these objects such as size, mass, and luminosity 
depend on the distance to the objects. However, since the BGPS observations are continuum observations. 
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they contain no kinematic information. In this paper, we use the hne-of-sight velocity (v^S-r) from a 
molecular line detection and a model of the Galaxy to determine a kinematic distance. Not only is the vlsr 
useful, but the line properties themselves can elucidate a number of properties of the dense gas in the clumps 
(e.g., virial mass, infalling gas, outflows, etc.). 

Most kinematic surveys of the Milky Way have been performed using low gas density tracers (e.g. H I: 
Giovanelh et al. 2005; ^^CO: Dame et al. 2001, GRS("CO): Jackson et al. 2006). With these low density 
tracers, most lines-of-sight in the Galaxy have multiple velocity components. To mitigate this, we choose 
dense gas tracers that will be excited exclusively in the BGPS clumps. Surveying dense gas has been done 
before using CS J = 2 — 1 toward IRDGs (see Jackson et al. 2008). In this survey, we simultaneously 
observe two dense gas tracers HCO"*" J = 3 — 2 and N2H+ J = 3 — 2 using the 1 mm ALMA prototype 
receiver on the Heinrich Hertz Submillimeter Telescope (HHT). The HHT resolution of ~ 30" at 1.1 mm is 
nearly beam-matched to the original BGPS survey, allowing a one-to-one comparison between these dense 
gas tracers and peak 1.1 mm continuum emission positions. These two molecular tracers have very similar 
effective excitation densities, Ueff ~ 10** cm~'^, that are well-matched to the average density derived from 
the continuum-emitting dust (see Dunham et al. 2010). The effective excitation density for a molecular 
tracer is defined in Evans (1999) as the density at a given kinetic temperature required to excite a 1 K line 
for a column density of logio N / Av = 13.5. To determine the effective density, we use RADEX, which is a 
Monte Carlo radiative transfer code (Van der Tak et al. 2007), assuming logio N = 13.5 cm~^ and Av = 1 
km s~^. 

The chemistry of HCO+ and N2H+ is useful, as these two molecules have opposite chemistries with respect 
to the CO molecule (J0rgcnsen et al. 2004). IICO+ is created by CO and N2II+ is destroyed by CO. The 
formation routes of HCO+ and N2H+ are dominated by the following reactions: 



For N2H+ to exist in large quantities, the gas must be cold where CO has frozen out onto dust grains. 
The ratio of N2II+/HCO"'" emission can be a chemical indicator of the amoimt of dense, cold gas in BGPS 
clumps. Even with the modest upper energy levels, E„/k (HCO+ J = 3 — 2) = 25.67 K and E„/k (N2II+ 
J = 3 — 2) = 26.81 K, these transitions can still be excited at low temperatures if the density of the gas is 
high enough. This brings up an interesting conflict for N2H+: chemically, it favors a low kinetic temperature 
where CO is depleted but higher T^m or higher gas density leads to a stronger excitation of the J = 3 — 2 
line. 

In this paper, we present the results for spectroscopic observations of 1882 BGPS clumps. In § 2 we 
discuss source selection and observing, calibration, and reduction procedures. In § 3 detection statistics, 
line intensities, velocities and linewidths are analyzed. In § 4 we calculate the kinematic distance to each 
detected source and determine our size-linewidth relations, clump mass spectra, and present a face-on view 
of the Milky Way Galaxy based on kinematic distances determined from our sample. 



H+ + C0^ HCO+ + H2 



(1) 



H+ + N2^ N2H+ + H2 
N2H+ + CO^N2 + HCO+ 



(2) 
(3) 



- 5 - 



2. Observations, Calibrations, and Reduction 

2.1. Facility and Setup 

Observations were conducted with the Heinrich Hertz SubmiUimeter Telescope on Mount Graham, Arizona. 
The data were taken over the course of 44 nights beginning in February 2009 and ending in June 2009. We 
utilized the ALMA Band-6 dual-polarization sideband-separating prototype receiver in a 4-IF setup (Lauria 
et al. 2006, ALMA memo #553). With this setup we simultaneously and separately observe both the upper 
and lower sidebands (USB and LSB, respectively) in horizontal polarization (Rpoi) and vertical polarization 
(Vpoi) using two different linearly polarized feeds on the receiver. The receiver was tuned to place the HCO+ 
J = 3 - 2 (267.5576259 GHz) Hue in the center of the LSB. The IF was set to 6 GHz, which offsets the 
N2H+ J = 3 - 2 hne (279.5118379 GHz) in the USB by +47A7 km s"^ The signals were recorded by 
the 1 GHz Filterbanks (1 MHz per channel, 512 MHz bandwidth in 4-IF mode; LSB velocity resolution 
Avch = 1-12 km s^^ and USB velocity resolution Avch — 1-07 km s^^) in each polarization and sideband 
pair {Ypoi LSB, Ypoi USB, Rpoi LSB, Rpoi USB). 



2.2. Calibration and Sideband Rejections 

Every observing session utilized 3 types of observations to calibrate the velocity offset, temperature scale, 
and rejection of the sideband separating receiver. The antenna temperature scale T^ is used at the HHT and 
is set by the chopper wheel calibration method (Penzias & Burrus 1973). This temperature scale was then 
converted to T,„f, = T\/rj„ib by observing Jupiter and calculating the main beam efficiency {rjmb] Mangum 
1993) 

frej ■ TXi Jupiter) 



J{Vs,Tjupiter) — J(Vs-,TcMb) 



I 1 1-1 "eqOpol 

exp [ — /?! 2 • 

mb 



(4) 



where Jiv^Ts) ~ cxp(feiy'/fcT'£;)-i Planck function in temperature units evaluated at the observed frequency 
V and brightness temperature T^, T^ is the average observed temperature of Jupiter in the bandpass, 
Jupiter — 170 K ±5 K, TcMB = 2.73 K, 9f,q & Opoi are the daily equatorial and poloidal angular diameters 
of Jupiter, O^b is the HHT FWHM (equal to 28.2" LSB and 26.9" USB). The sideband rejection correction 
factor is given by, 

" I{TXLSB) ) ■ 

We calculate frej, by measuring the integrated intensity of the flux that bleeds over from the LSB into the 
USB by observing S140, a source with very strong HCO+ J = 3 — 2 emission {Tmb — 18 K). The average 
rejection was —13.8 dB in Ypoi and —15.2 dB in Upoi- We ignore the difference in atmospheric opacities 
between 267 GHz and 279 GHz since it is small. 



We report the computed rjmb for each observing session in Table [T] (see Figure [T]). Each data point for rimb 
in Figure [T] consists of the average of five or more observations of Jupiter each night. There are no apparent 
trends in rjmb versus time except on MJD 918 - 920, when r],nb for Ypoi is significantly lower than for the rest 
of the observing sessions. For these dates we choose to use the average r/mb and treat them independently 
from the rest of the calibration. A drop in the integrated intensity, I(T^), is also seen in the data taken of 
the two spectral line calibration sources, S140 and W75(OH) (Figure [T]), supporting the decision to treat 
these days separately. 

We also compared r/mb for the two polarizations and sidebands against each other (Figure [2]) . The two 
upper panels compare the USB and LSB of each polarizations against each other. These are highly correlated 
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with Spearman's rank coefficients of p ^ 1, which is expected, as each linear polarization has its own feed. 
The Spearman's rank coefficient is a measure of the monotonic dependence between two variables. The 
lower panels compare the two polarization feeds of the LSB and two polarization feeds of the USB against 
each other. These are less well-correlated with Spearman's rank coefficients of p ^ 0.5 and show that the 
variation we see in r]mb does not come from a systematic effect that affects both polarizations at the same 
time. 

We observed each source in the catalog described in § |2.3| for 2 minutes total integration time. We position 
switched between a common OFF position for each 0.5 degrees in Galactic longitude {£). Each of these OFF 
positions was observed for 6 minutes to check if the OFF position had any detectable line emission. For a 
subset of sources near the end of our observations, starting on MJD — 245400 = 986, the integration time 
was increased to 4 or f minutes to compensate for deteriorating weather. Our goal was to keep the baseline 
rms less than fOO mK {Avch ^ 1 kni s^^) for as many sources as possible. S140 was also used to calculate 
the Allan Variance of the ALMA prototype receiver (Schieder & Kramer 2001). Accounting for the Allan 
Variance, we determined that ~ 20 seconds was the optimal switching time between ON and OFF positions 
for position switching with the 1 mm ALMA prototype receiver. 

2.3. Source Selection 

We selected sources out of a preliminary version of the BGPS source catalog (BOLOCAT, Rosolowsky et 
al. 2010). We used the BOLOCAT version 0.7 to divide sources into logarithmic flux bins with equal numbers 
using the 40" aperture flux. We point at peak 1.1 mm continuum positions as listed in the BOLOCAT and 
we restricted the range of Galactic longitudes to fall between 10° < i < 100°. The entire BOLOCAT in this 
longitude range was divided into logarithmically-spaced flux bins from Si.imm — 0.1 Jy to Si imm ~ 0-4 Jy 
in intervals of logio(Si.imm) = 0.1. All sources greater than Si.imm ~ 0.4 Jy were included for observation 
{N = 689). Below this flux, 100 sources per bin were selected at random to comprise a flux-selected set of 
N = 1,289 sources from the BOLOCAT vO.7. In addition, we observed all BOLOCAT vO.7 sources in £ 
ranges of 10° - 11.5°, 15° - 21°, and 80° - 85.5°. These ranges were chosen due to a combination of overlap 
with other surveys and observing availability. Sources with £ > 100° were taken from the BOLOCAT vl.O 
and were only restricted by observability. 

Near the end of our spectroscopic survey, the BGPS version 1.0 maps and the BOLOCAT vl.O were 
released. We recomputed the photometry of our sources at the observed vO.7 positions on the vl.O maps 
using the HHT beamsize, this is the data presented in § 3. A correction factor of 1.5 was multiplied to all of 
the 1.1 mm fluxes (see Aguirre et al. 2011). This calibration factor was determined by the BGPS team to 
account, in part, for the spatial filtering present in the BGPS vl.O maps and possible calibration differences 
between the BGPS and other surveys. This factor brings the BGPS fluxes inline with those of other surveys 
(e.g. Motte et al. 2007). 

Between versions of the catalogs the algorithms for processing the images were improved and thus source 
peak continuum positions may have moved or sources may even be removed from the catalog. In § 4 we 
compare physical properties of the sources and need to know the overall source properties not just the 
photometry of the locations we observed. We take the nearest source from the vl.O catalog to the position 
we observed. The median offset between observed vO.7 positions and vl.O positions is 6.4". The vast majority 
(83% ) of vO.7 observed positions lie within 15" (1/2 the beam width) of the nearest vl.O position. Since the 
median angular size of our observed sources is ~ 60", the small positional differences between vO.7 and vl.O 
do not significantly affect the physical properties derived from the 1.1 mm emission (e.g., size, mass). No 
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source for which we have resolved the distance ambiguity and derived physical properties for in §4 (Known 
Distance Sample) has an offset between the catalogs greater than 30" (one beam width) when determining 
physical properties. 

In the following analysis, we refer to the "Full Sample" of 1882 observed sources and the "Deep Sample" of 
707 sources where the entire BOLOCAT was observed within the longitude ranges described above. Figure 
|4] shows the location of the sources we observed in the Full Sample and the sources in the Deep Sample. 

2.4. Data Reduction 

The spectra were reduced using scripts we developed for the CLASS software packag^ In 4-IF mode, 
there are four filterbank spectra for each source. The HCO+ J = 3 — 2 spectrum was used to determine the 
baseline window, typically ±50 - 75 km from the line center, and to determine the line window, typically 
±10-15 km s~^ from line center. The two polarizations of HCO"'" J = 3 — 2 data were then baselined, and 
averaged together. This averaged spectrum is used to determine the line flag of the observed source and the 
line flag is determined from the approximate line shape (see Table [2]). A flag of means there is no apparent 
line in the spectrum at any velocity. A flag of 1 indicates a single line in the spectrum and the line exhibits 
no apparent structure from a line wing or self-absorbed profile. A flag of 2 means there was confusion along 
the line of sight and multiple velocity components are observed. In this case there is no way of determining 
which source is associated with the 1.1 mm map without mapping the molecular emission. A flag of 4 means 
there was a possible line wing. A Gaussian fit is plotted over the data to emphasize any deviations from 
a Gaussian shape; this helped to accentuate any sources with line profiles with a red or blue line wing, or 
possibly both. A flag of 5 means the line profile was possibly self-absorbed. Examples of spectra for the 
various flags are shown in Figure [5] 

The N2II+ J = 3 — 2 line is offset +47.47 km s~^ from the center of the USB; therefore, each of the 
HCO"'" baseline windows is shifted by that offset in order to baseline the N2H"'" data. The N2H"'" spectra are 
baselined and averaged in the same manner as above and the line is flagged for its quality and structure. 

After each spectrum is flagged, it is converted to the T^b scale and the two polarizations are averaged 
together. Each spectrum is corrected with the corresponding ry„i{, given by the date it was observed and 
its polarization, as explained in §2.2[ The spectra are weighted by their baseline rms values, averaged, and 
baselined. The resulting combined spectra are used in our analysis. 

2.5. Analysis Pipeline 

Once the spectrum for each source has been calibrated and averaged, the next step is to measure the line 
properties. The analysis of all the spectra is performed in IDL using custom and ASTROLIB routines, and 
all CLASS spectra are exported to an ASCII file containing the final spectrum. The peak temperature is 
given by the maximum temperature within the defined line window and the error is the rms of the data 
outside of that window. The integrated intensity, central velocity, and line width are computed using both 
an analysis of the 0*'', 1**, and 2"'' moments and by fitting a Gaussian model to the spectral line. 



3http://www.iram.fr/IRAMFR/GILDAS/ 
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2.5.1. Moment Analysis 
The moments of a spectral line are calculated using 

M„ = £ T,v^^v,h, (6) 

i=vi 

where n is the moment and i represents each channel between the vi and defining the line window. These 
moments are then used to compute the integrated intensity, central velocity, and FWHM using 

/(T„,b) = Afo (7) 

.cen = ^ (8) 

Moment calculations are sensitive to the rms of the baseline and our generously large line windows. For 
lower signal-to-noise lines, a small noise feature in the spectrum can drastically change the first moment 
when using all of the data points in the line window. To compensate for low signal-to-noise we estimate the 
line center using only data three times the baseline rms in the first moment calculation. This new method 
returns velocities and widths that more closely match those that are determined by eye than the method 
using all of the data within the line window. This only has significant effects for low signal-to-noise spectra. 



2.5.2. Gaussian Fitting 

Another method of determining the central velocity and FWHM for a spectral line is to fit a Gaussian 
model to the line profile. This method has its drawbacks as well, but the main drawback is that it struggles 
with lines that deviate strongly from Gaussian shaped line profiles. Examples of such arc lines with self- 
absorbed profiles or lines with very prominent line wings, (Figure [s] (c) and (d)). The Gaussian fits are 
computed with the MPFITPEAK function (Markwardt 2009) and return a reduced x^- The boundary 
conditions chosen are the following: 1) the baseline is defined to be 0, 2) the peak line temperature of the 
Gaussian is defined to be positive, 3) the central velocity of the line must be within the line window, and 4) 
the FWHM must be smaller than the line window. For the starting parameters of the fit, we use the results 
from the moment analysis. 



At this point we refine our method of computing the desired quantities for the line shape. After the initial 
Gaussian fit is completed, the next step is to modify the line window and recompute both the moment analysis 
and the Gaussian fitting. To modify the line window we used the parameters of the line as determined by the 
Gaussian fit to center the line window on the line center, Ycauss, and extend it by the measured lincwidth, 
±2 • acauss- If either bound of this new line window extends outside of the original, the original bounding 
velocity is used for that term. The fit and moments are recomputed but do not yield significant changes for 
most sources. At this time aj (the error on the integrated intensity) is calculated using the final "fit" line 
window. <7i = (Tt ■ \/ 5vch • {vu — vi) where Vch is the channel width {6vch = 1-1 km/s), Vu and vi are the 
upper and lower bounds of the line window, and ctt is the baseline rms. If the line was undetected, flag of 
0, the line window used to estimate the error is ~ 6 km s~^, within which more than 95% of all measured 
FWHMs lie (Figure [7|. The Gaussian fits are used to determine the HCO+ and N2H+ central velocities 
and the HCO"'" linewidths while the zero*'' moment is used to calculate the integrated intensity. For N2H+ 
linewidths, we use an IDL script that deals with the hyperfine lines (assuming a Gaussian shape for each 
hyperfine line) and uses MPFIT to determine the best-fit line profile. For the rest of the paper, linewidths 
refer to only the Gaussian-fit, observed HCO"'" linewidth. 
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3. Detection Statistics and Analysis 

In this section, we discuss the statistics of the detected sources in HCO"*" and N2H+ and correlations 
between integrated intensity for HCO+ and N2H+ with respect to the two sample groups. We also discuss 
the determined vlsr and the analysis of the line centroids and linewidths. 

3.1. Detection Statistics 

Each source has two flags, one for HCO+ and one for N2H+; multiple flags arc not set for any of the 
sources/tracer pairs; it is either a detection (1,2,4,5) or a non detection (0). A breakdown of the number of 
sources with each flag is shown in Figure [S] Out of a total of 1882 sources observed we detect 1444 (76.7%) 
in HCO+ and 952 (50.5%) in N2H+ at a 3cr or greater level. Out of 1444 HCO+ detections, 1119 (77.49%) 
are single-velocity component detections, 39 (2.70%) are multiple-velocity component detections, 67 (4.64%) 
have possible line wings, and 219 (15.17%) have a possible self-absorption profile. For the 952 N2H+ detec- 
tions, 919 (96.53%) are single-velocity component detections, 14 (1.47%) are multiple-velocity component 
detections, 6 (0.63%) have possible line wings, and 13 (1.37%) have a possible self-absorption profile. Break- 
ing the sources down into the "Deep Sample" where we observed every source in the BOLOCAT vO.7 in 
certain i ranges, we find slightly lower detection rates: 72.6% and 41.2% of the N = 707 sources are detected 
in HCO"'" and N2H+ respectively (See Figure [s] for the breakdown of flagging statistics for the "Deep Sam- 
ple"). Detection statistics versus 1.1 mm dust emission are presented in Figure [9| For sources in the lowest 
flux percentile, we detect barely 40% in HCO+. Sources in the highest flux percentile have a 99% detection 
rate in HCO+ and 88% in N2H+. 

We find that nearly all (99.6%) of N2H+ detections are associated with an HCO'*' detection at the > 3a 
level. Only 4 sources that are detected in N2H+ do not have a 3cr detection in HCO+. These sources 
are approximately 2a detections and show a small amount of HCO"'' emission at the correct velocity to be 
associated with the N2H+ emission. The HCO+ line flag statistics change in a fairly interesting way for the 
sources with detected N2H+. The percentage of sources showing a possible self-absorbed profile increases 
from 15.17% to 21.10%. The percentage of single line detections drops by a similar amount. About 1% of 
sources show possible self-absorption in both HCO+ and in N2H+. For sources that display self absorption 
in N2H+, 11 of 13 also showed self absorption in HCO+. 

A recent mapping survey of these two molecular transitions toward a sample of IRDCs has shown that 
HCO+ and N2H+ J = 3 — 2 emission has a very similar extent and morphology to the 1.1 mm emission 
(Battersby et al. 2010). To completely understand the physical properties of the gas that is excited in these 
transitions, we require a detailed source model and radiative transfer modeling of multiple transitions in 
each species. However, from our astrochemical knowledge of these two species, we can make some general 
statements about the regions where they are excited. HCO"'' probes clumps with a wide range of properties. 
It can exist in warm regions where CO is abundant (e.g., Reiter et al. 2011) and cold regions where CO 
has frozen out onto dust grains (e.g., Gregersen and Evans 2000). It is possible that HCO+ is depleted by 
freeze-out in some of these clumps with cold (T < 20 K) dense {n > 10'* cm""^) gas within the HHT beam 
(see Tafalla et al. (2006) for examples observed toward low mass cores), although our observations indicate 
that this mechanism is unlikely to be dominant in BGPS clumps. HCO^ J = 3 — 2 emission likely originates 
in the dense, warm inner regions of these clumps. In contrast, N2H''' is destroyed by CO in the gas phase, 
and thus N2H+ is most abundant in cold, dense gas where the CO abundance is depleted (J0rgensen et 
al. 2004). Thus, in star- forming clumps with a strong temperature increase toward the center, N2H"'" may 
only be tracing the outer parts of the clumps where the gas is still relatively dense and cold. This chemical 
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differentiation of N2H+ has been mapped in a few high-mass star-forming regions (Pirogov et al. 2007; 
Reiter et al. 2011; Busquet et al. 2010), although the differentiation mostly occurs on size scales that are 
unresolved within our 30" beam. 



In nearly 12% of sources, the HCO+ line profiles display apparent self-absorption. For an optically thick 
line profile, a blue asymmetry (redshifted self-absorption) may be an indication of infalling gas (see Myers et 
al. 2000); however, the blue asymmetric profile can also be created by rotating and outflowing gas (Redman 
et al. 2004). For a large sample of sources, it is possible to statistically identify infall in the population by 
searching for an excess of blue asymmetric profiles. Infall does not create a red asymmetric profile in centrally 
heated, optically thick gas while rotation and outflow can equally produce both blue and red asymmetric 
profiles. Surveys of high-mass star-forming regions in HCN J = 3 — 2 have shown statistical excesses in 
blue asymmetric profiles (Wu & Evans 2004). In order to calculate the line asymmetry of our subset of 
sources with self-absorbed profiles, we must obtain observations of an optically thin isotopologue (II^'^CO+) 
to discriminate between self-absorption and a cloud with two closely-spaced velocity components along the 
line of sight. We shall observe this subset of sources in H^'^CO+ J = 3 — 2 in a future study. 



3.2. Integrated Intensity and Peak Line Temperature Analysis 

3.2.1. Comparison of Molecular Emission 



Figures 10 (a) and (b) show the difference in the distributions of line temperature and integrated intensity 
for HCO"'" and N2H+ J = 3 — 2. The IICO+ emission extends to far greater intensities than N2II+, whose 
distribution seems to be truncated at high intensities. We find, on average, in our Full Sample, HCO+ 



J = 3 — 2 to be 2.18 times as bright as N2H+ J = 3 — 2 in integrated intensity I(T,„f,) (Figure 11 (a)). There 
are a small number of sources (11.8%, N = 114) detected in both HCO+ and N2H+ that show stronger 
N2H+ J = 3 — 2 emission than HCO+ J 3 — 2 emission. For these sources 1/3 are self absorbed in IICO+. 
The integrated intensity of each species is highly correlated with a Spearman's rank correlation coefficient 
oi p = 0.82. The slope of a linear regression (MPFIT) is m = 0.82, taking into account errors in x and y 
directions. The highest intensity points appear to form a tail turning upward on the plot of I(T„ib HCO^) 
versus I(T„jb N2II+). This would be expected for the warmest clumps since the N2H"'" abundance is expected 
to decrease in warmer regions, which should be more prevalent toward brighter 1.1 mm sources (§3.2.2). 



The peak line temperature tells a similar story to the integrated intensities. The upward curving tail of 
points at the brightest end of the Full Sample is less noticeable for peak line temperature (Figure 11 (b)). 
The average ratio of peak line temperatures is T„jf,(HCO+ J = 3 — 2)/T,„(,(N2lI+ J = 3 — 2) = 1.94 and 
the correlation coefficient is p — 0.79. The best fit line has a slope of m = 0.83. 



It is interesting that HCO"^ and N2II+ J 3 — 2, with their similar effective densities but different 
chemistries, are so highly correlated. Their similar effective densities should result in their emission being 
co-spatial; however, their chemical differences should result in differentiation (e.g., J0rgensen et al. 2004; 
Pirogov et al. 2007). It is likely that any differentiation is unresolved within our beam, which averages 
over the densities, temperatures, and abundance structure on size scales of a few tenths of a parsec (e.g., 
Battersby et al. 2010; Reiter et al. 2011). 
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3.2.2. Comparison with Millimeter Continuum Emission 



In Figures 12 (a) and (b), we show the integrated intensity of HCO+ J = 3 — 2 and N2H+ J = 3 — 2 versus 
the 1.1 mm flux per beam obtained from the BOLOCAT vO.7 positions on the vl.O BGPS maps (Rosolowsky 
et al. 2010). The Spearman's rank coefficient for the two species are Phco+ = 0-80 and PN2H+ = 0.73. The 
slopes are mfjQo+ — 1-15 and rnj^^H+ = 1-28. For HCO"'", the high 1.1 mm flux points have a tail that 
continues curving up toward higher HCO"*" emission with increasing 1.1 mm flux. In contrast, the N2H+ 
J = 3 — 2 emission for high 1.1 mm fluxes shows a flattening which is again consistent with N2H+ being less 
abundant in warm sources. The median ratio of integrated intensity to 1.1 mm emission is 6.32 K km s^^ 
per Jy/beam and 3.27 K km per Jy/beam for HCO+ J = 3 — 2 and N2H+ J = 3 — 2, respectively. 



Comparing the peak line temperatures of the molecular emission versus the 1.1 mm dust flux (Figures 



12 (c) and (d)) leads to similar results as the integrated intensities. The correlations are still significant: 



PHCO+ — 0-'''5 and PN2H+ = 0.73. The slopes are lower than for integrated intensity: to//co+ = 0.88 and 
mjs!^H+ = 0.98. The median ratio of peak line temperature to 1.1 mm emission is 1.76 K per Jy/beam and 
1.06 K per Jy/beam for HCO+ J = 3 - 2 and N2H+ J = 3 - 2, respectively. 



We also compare the ratios of the integrated intensities and the peak line temperatures of IICO+ and N2II+ 
with 1.1 mm flux in Figure [13} Both ratios are uncorrelated with the 1.1 mm dust emission. Surprisingly, 
there is a wide range in the observed intensity and peak temperature ratios, even for bright sources with 
fluxes above 1 Jy. Even in these brightest 1.1 mm sources, the N2H+ J = 3 — 2 emission can be strong, 
indicating significant amounts of unresolved dense, cold (T < 20 K) gas within the beam. 



3.3. Velocities and Linewidths 



We use the Gaussian fit velocity centers of the HCO+ and N2II+ lines as described in \ 2.5.2 to determine 



^LSR.- We plot vlsr versus Galactic longitude in Figure 14 (a). We find the distribution of vlsr hi the 
dense gas tracers is comparable to that of CO 1-0 (Dame et al. 2001). The spread in dense gas velocities is 
very similar to the spread in CO emission at each i when our data is overplotted on the Dame et al. (2001) 
V — £ map. The vlsr determined for sources detected in both HCO+ and N2H+ agree very well, see Figure 
(6). We use only the Gaussian fit HCO+ velocities in §4 to calculate the kinematic distances of sources. 



14 



Figure 



14 



(c) shows the FWHM of our detected IICO+ lines versus Galactic longitude. There is no trend 
with £ apparent in the sources we have observed. The few bins where the linewidth seems to vary by an 
appreciable amount have small numbers of sources in them. There is a moderate relationship in the plot 
of A-ypCO"*"] versus I{Tmb)[ HCO+], see Figure 15 (a). This is expected, as the integrated area of a line 



is directly related to the peak temperature multiplied by the FWHM. Given the relationship between Av[ 
HCO+] and I{Tmb)[ HCO+] and Si.imm and IiTMB)[ HCO+], it is logical to expect a trend of Av with 
Si.imm', Figure 15 (b) shows this trend. A moderate correlation also exists between the Hnewidth and the 



1.1 mm dust emission per beam at the BOLOCAT vO.7 positions; however, there is large amount of scatter 
around this trend. 



- 12 - 



4. Discussion 



4.1. Kinematic Distances 



We use the kinematic model of the Galaxy as defined by the parameters determined by Reid et al. (2009) 
to calculate the near and far distances to BGPS clumps. One thing to note is that the distance determination 
for Reid et al. (2009) assumes all motions are in the azimuthal direction only and does not account for any 
radial streaming which is known to exist near ^ ~ 0. This model sets the distance from the Galactic center 
to the Sun to be Rq = 8.4 ± 0.6 kpc and the circular rotation speed 0o = 254 ± 16 km s~^ from VLBI 
parallax measurements. We then use these parameters and the kinematic definition of vlsr to compute the 
distances to all of our sources with single HCO+ velocity components. 



The distances for all detected sources are plotted versus Galactic longitude in Figure [16] In the first 
quadrant (0° < £ < 90°), a velocity will give two distances that are degenerate. Without further information, 
we cannot tell if a source is on the near side or the far side of the galaxy. For sources that are known to 
be within a given region, and thus approximately the same distance, we can quantify the velocity spread 
of the individual sources. For instance, the spread in vlsr for sources in 109° < £ < 112° is 4.9 km s~^. 
This is one measure of the systematic "random" errors in our vlsr due to intrinsic motion that limits the 
accuracy of the corresponding distances. Some sources have much larger peculiar motions determined from 
VBLI parallax, as great as 40 km s~^ (Nagayama et al. 2011), but it is not likely the majority of sources will 
be severely discrepant. Some distribution of distances is expected and the spread in velocities for sources 
nearby makes an accurate kinematic distance determination difficult. In our distance determination, a cloud 
with a velocity at or greater than the tangent velocity will be placed at the tangent distance. In § 4.2, we 
resolve the distance ambiguity for a subsample of our sources. 



4.1.1. Galactocentric Distance 



The Galactocentric distance is the distance of the source from the Galactic Center and is only dependent 
on velocity and Galactic longitude of a source and therefore does not have a distance ambiguity. We plot 
a variety of source properties versus their Galactocentric distance in Figure |17| The distribution of sources 
clearly traces four major spiral arm structures in the Galaxy. The large peak at 4.5 kpc corresponds to 
the molecular ring and, for sources near £ — 30° , the edge of the central bar. The largest concentration 
of sources is within these two structures. The other structures in order of galactocentric distance are the 
Sagittarius arm, the local arm, and the Perseus arm. 



We also plot the observed quantities (linewidth, integrated intensity, and 1.1 mm flux) versus Galacto- 
centric distance. There is a large amount of scatter in each 1.5 kpc bin, and the median values of each 
quantity are nearly constant except for the bin beyond 10 kpc. The median linewidth, integrated intensity 
(both IICO+ and N2H"'"), and 1.1 mm flux are systematically lower for sources beyond 10 kpc compared 
to smaller Galactocentric distances. This could be due to a bias in the original BGPS observing strategy. 
Sources at Galactocentric distances greater than 10 kpc are predominately in the second quadrant of the 
Galaxy. Only a few selected regions (e.g.. Gem OBI, G111/NGC7538, IC 1396) were observed by the BGPS 
in this quadrant. Unlike BGPS observations toward the first quadrant, these second-quadrant heterodyne 
follow-up observations are not a complete census of sources in the second quadrant and biased toward known 
star forming regions. It is possible that the observed regions are not entirely representative of the properties 
of Galactic sources at this distance biasing our results to sources with stronger IICO+ and N2II+ lines. This 
would make the "true" Galactocentric gradient larger than what we see in Figure 17 Another possibility is 
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that nitrogen and/or carbon metallicity gradients in the Galaxy are becoming important, and that is why 
HCO"'" and N2H+ are becoming weaker, on average, beyond 10 kpc. There is a decreasing trend for both N 
and C in OB stars when going from the inner galaxy to the outer galaxy (Daflon & Cunha 2004) . This same 
effect may manifest itself in the dense gas as well; however, more complete sampling is needed to understand 
these effects. 



4.2. Resolving the Distance Ambiguity 

We must determine whether or not a source lies on the near or far side of the tangent distance in order to 
resolve the distance ambiguity (see Figure 16). We use three conservative methods to break the degeneracy: 
coincidence with sources with observed maser parallax measurements, coincidence with Infrared Dark Clouds 
(IRDCs),and correspondence with known kinematic structures in the Galaxy. We then use this subsample 
of sources where the distance ambiguity has been resolved, the "Known Distance Sample," to study the 
properties of BGPS objects. A detailed study of the probability that BGPS sources in the first quadrant lie 
at the near or far distance is currently being made by Ellsworth-Bowers et al. (in preparation). 



The most accurate distance determination technique is direct parallax measurements of sources by very 
long baseline interferometry (e.g., Reid et al. 2009a, 2009b). For a source to be considered associated with 
a VLBA-determined parallax, we allow for a source position to be different from the VLBA position by up 
to 30" (one beam size). We have 4 sources that are coincident on the sky with a VLBA source but only 3 
with a single HCO"'" detection. The distances used to these objects are their parallax distances. 



The next selection criterion we used to determine distance is coincidence with an IRDC. IRDCs are clouds 
of dust that appear dark against the background of mid-infrared Galactic radiation (for example at 8 /Ltm) . 
Because these objects appear in front of the majority of Galactic emission, they are assumed to be on the near 
side of the galaxy. Placing an IRDC is at the near kinematic distance is a good assumption but disregards 
the fact that a small number of IRDCs could be at the far distance. IRDCs have been catalogued in the 
Galactic plane from MSX mid-infrared observations (Egan et al. 1999; Simon et al. 2006) and most recently 
from Spitzer Space Telescope observations (Peretto & Fuller 2009). Peretto & Fuller (2009) developed a 
catalog of IRDCs and describe each cloud with an ellipse. We first select sources that lie within the ellipse 
itself. In reality, IRDCs have a wide range of projected geometries and a simple ellipse is not always the 
best choice to describe more complicated filamentary shapes. Therefore, we also did a by-eye comparison of 
Spitzer GLIMPSE (Benjamin et al. 2003) images and BGPS images and made a list of BGPS sources that 
appeared to be coincident with an IRDC. Both the ellipse-coincidence method and the by-eye method suffer 
from different biases (e.g., the ellipse shape is too simple and the by-eye method is subjective and depends 
on image display parameters). We conservatively choose sources that are coincident with an IRDC from 
both methods to be assumed to be at the near distance {N = 192). 

We have also included sources in the Known Distance Sample that do not have distance ambiguities, 
including sources in the Outer Galaxy {£ > 90°, N = 89) and sources in the first quadrant that lie at the 
tangent distance. A caveat is that the method used to compute the distances forces a source to be at the 
tangent distance if its velocity is larger than allowed in the circular rotation model of the Galaxy. We also 
include sources that are coincident with the sources in Shirley et al. (2003) and the H II regions in Kolpak 
et al. (2003). In addition to all of this we use kinematic information from Dame et al. (2001) to include 
sources near £ ^ 80° that have the corresponding velocities of the Cygnus-X region. The other sources in 
this i range have negative velocities, indicating they lie in the outer arms. We do the same analysis and 
add sources near 20° < £ < 55° that have velocities corresponding to the Outer Arm. We also add a few 
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sources with ^ ^ 10 that correspond to the 3 kpc Arm. All of the 529 sources for which the distance has 
been determined comprise our Known Distance Sample. For the remainder of the paper, we will only use 
the Known Distance Sample for our analysis of source properties unless otherwise specified. Resolution of 
the distance ambiguity for all observed sources is beyond the scope of this paper. In a future paper, we will 
build probability density functions for the distances to BGPS sources by combining dense gas tracers such as 
HCO"*" and N2H+ with extant data sets such as the Galactic Ring Survey (-"^^CO ( J = 1 — 0) for diffuse-gas 
velocity comparison, H I Galactic Plane Surveys (VGPS, CGPS, SGPS) for H I Self-Absoption, models of 
Galactic molecular gas distribution, and a more refined analysis with IRDCs (Ellsworth-Bowers et al., in 
preparation) . 



Figure [18] is a histogram of the heliocentric distances for the Known Distance Sample. These sources 
include outer Galaxy sources, so the peaks of the distribution do not always correspond to a specific spiral 
arm. The observed peak of sources at 5 kpc from us does correspond with the Near 3 kpc arm and the edge 
of the Galactic bar. The median distance of the Known Distance Sample is 2.65 kpc. The large number of 
sources at a distance of 1-2 kpc comes from sources that are in the range of ^ = 80° — 85° and those in the 
outer galaxy (W3/4/5, NGC 7538). In this area, the intrinsic dispersion of velocities is much larger than the 
allowable velocities for the kinematic model of the galaxy so we use known average distances to the regions 
in this i range. 



Compiling this distance information together gives us a look at Galactic structure. Figure 19 shows the 
face-on view of the Milky Way given the kinematic distances we have measured. The sources that lie in the 
Near Sample are separated from those where we have not resolved the distance ambiguity. The unresolved 
sources are plotted twice for their near and far distances. Even with the distance ambiguity affecting the 
majority of our sources one can begin to see strong evidence for Galactic structure including the Near 3 kpc 
arm, the end of the Galactic bar, as well as the Sagittarius and Perseus Arms. The "molecular ring" is also 
visible between 3 and 5 kpc from the center of the Galaxy. 



4.3. Size-Linewidth Relation 



Several physical properties of the clumps may be derived once the distance to the source is determined. 
The size of BGPS sources were determined from analysis of the flux distribution in 1.1 mm continuum images 
(Rosolowsky et al. 2010). The physical size of the object is simply r = 9 ■ Distance, where 9 is the angular 
radius of the sources. Figure [20] (a) shows histograms of the deconvolved angular source size as measured in 
the BOLOCAT vl.O. The median source size for the Known Distance Sample is ~ 60", or 0.752 pc. This is a 
factor of 2.35 larger than the CS J = 5 — 4 source sizes from Shirley et al. 2003 (0.32 pc median source size) 
but inhne with Wu et al. 2010 who find median sizes of 0.71 pc in HON ,7= 1 - 0.77 pc in CS J 2 - 1. 
This indicates that the typical BGPS clump is of lower density and more extended than the sample of water 
maser selected clumps traced in CS J = 5 — 4 in the Shirley et al. survey. CS J = 5 — 4 also has an effective 
density an order of magnitude higher than HCO+ J = 3 — 2 and is tracing the denser gas toward the center 
(Reiter et al. 2011). The Shirley et al. (2003) CS J = 5 — 4 survey was a very heterogeneous sample of 
sources that spanned a wide range of distances. In comparison to a more homogenous sample at a common 
distance the continuum survey of the nearby CMC Cygnus X, by Motte et al. (2007), find an average source 
size of 1.2 mm continuum clumps of 0.1 pc, with a HPBW of 11"; this is a factor of seven times smaller than 
the median BGPS clumps size. 



We plot the clump size against the distance to the source (Figure 20 (b)). There is a very strong relationship 
directly due to the fact that the physical size grows directly proportional to distance for a given angular size. 
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The line indicates the physical size that is equivalent to our beamsize at each distance. This shows that 
89% of sources are spatially resolved by the BGPS beam. We caution that we are not resolving everything 
within the source as there is clear evidence of unresolved cores within the larger clump structures. Rather, 
the large size indicates that BGPS objects are, on average, larger and more diffuse structures than have been 
systematically studied before in high-mass star- format ion surveys. 



The integrated kinematic motions along the line-of-sight determine the observed FWHM linewidth of 
the clumps. Linewidths may be broadened due to thermal motions, unresolved bulk flows of gas across a 
cloud, small scale (unresolved) turbulence, and optical depth effects. Thermal broadening for our sources is 
minimal since AvthermiFW H M) = 0.612 km s~^ at T = 20 K. The observed median linewidth of HCO+ 
for the Known Distance Sample is 2.98 km s~^, which indicates supersonic motions within the 30" beam. 
The typical linewidth is comparable with the observed linewidths toward high-mass star forming clumps 
(e.g., Shirley et al. 2003) and much larger than the typical linewidths observed toward nearby low-mass star 
forming cores (e.g., Rosolowsky et al. 2008). Comparing the distribution of linewidths versus distance shows 
no trend at all. As a source gets farther away, our beam is averaging over a larger source size; yet, we do 
not see any systematic increase in linewidth with distance. 



Optical depth effects may broaden the observed linewidth. This effect may be especially acute for the 
HCO"^ J = 3 — 2 line (e.g., > 11% of sources have evidence of possible self-absorption). Equation 
that as the optical depth increases, the linewidth increases: 



10 



shows 



Av 1 



^"(r^^^) (10) 



(Phillips et al. 1979). For r = 10, the optically thick linewidth to the optically thin linewidth ratio is larger 
by a factor of 2. For our observed linewidths, which are 10 times the thermal linewidth, even accounting for 
modest optical depth effects, it is unlikely that optical depth effects can account for the large observed median 
linewidth. Therefore, we conclude that the dense molecular gas in typical BGPS sources is characterized 
primarily by supersonic turbulence. 



The size and linewidth of the Known Distance Sample clumps are directly compared in Figure 21 (a). 
In contrast to the Larson relationship for molecular clouds (Larson 1981) we do not observe a strong size- 
linewidth relationship in the dense molecular gas for BGPS clumps: pspearman — 0.40. While there is a 
very weak trend that generally agrees with Larson's relationship, the scatter in the data erases our ability to 
say much about it. Traditionally, studies of the size-linewidth relationship in cores find two different slopes 
depending on the mass of the objects (Caselli & Myers 1995). For instance, the study of Caselli & Myers 
(1995) find a very shallow slope of R°'^^ for high-mass cores and a much steeper slope of R°-^'^ for low-mass 
cores. Combining these two distributions may partially erase the size-linewidth relationship (Shirley et al. 
2003); however, the derived power laws from Caselli & Myers (1995) predict significantly smaller linewidths 
than the sources in the Known Distance Sample. The lack of a correlation between size and linewidth argues 
against a universal scaling relationship between the amount of supersonic turbulence in dense clumps and 
their size. 
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4.4. Mass Calculations 

The clump mass may be calculated from the total 1.1 mm continuum flux for each source in the Known 
Distance Sample by 

J^UyJ- dust ) ' l^dust.l.l ' IQQ 

(Hildebrand 1983) where S'l.i is the total dust emission from a source, D is the distance to the source, 
K£j„st(l-l"im) = 1.14 cm^ g^^ is the dust opacity at 1.1 mm (Osscnkopf & Hcnning 1994), -Bi.imm is the 
blackbody intensity at 1.1 mm where we initially assume a temperature of T^ust = 20 K. We also assume 
a gas-to-dust mass ratio of 100:1 (Hildebrand 1983). Millimeter dust continuum observations are incredibly 
sensitive to small amounts of mass, as we can detect a 20 K source of 1.1 Mq at distances of 1 kpc given an 
average 3a flux threshold of 90 mJy at 1.1 mm. Below (§ 4.4.1), we will systematically explore the results 
of changing the dust temperature distribution of BGPS sources using a Monte Carlo simulation. 



Figure 22 (a) plots a histogram of the masses of the Known Distance Sample assuming Tdust = 20 K. The 
median mass of the sample is 320 Mq and the mean is 1648 Mq. The BOLOCAT is complete to 98% for 
sources > 0.4 Jy; this gives us a completeness limit for masses of 580 Mq at a distance of 8.5 kpc (most 



of our sources are at the near distance). The mass versus distance plot, Figure 22 (b) shows the expected 
trend of more massive sources at the farthest distances due to the term in the mass calculation. The 
masses we observed range from 10 Mq to 10^ Mq. Compared to other samples of high-mass clumps, we 
probe similar mass ranges as the H2O maser sample of Shirley et al. (2003); although, our observed median 
mass is smaller. The mass distribution is similar to that observed for the BGPS Galactic Center sample for 
sources assumed to be at the distance of the Galactic center, 8.5 kpc (Bally et al. 2010). In comparison 
with clumps in targeted regions in Orion (Johnstone & Bally 2006) and Cygnus-X (Motte et al. 2007) probe 
typical masses of 10s to 100s of Mq, more analogous to core masses. In comparison to IRDCs, (Rathborne 
et al. 2005, Peretto & Fuller 2010), we appear to probe the same physical properties on the same scales. 
IRDC fragments tend to agree with masses of clumps in targeted regions, while the overall IRDC properties 
agree with those found in this study. For example, in Peretto & Fuller (2010) they find mass ranges of a 
few tenths of solar masses to nearly 10^ Mq and are complete to ~ 800 Mq. We are sensitive throughout 
the entire mass range of these other samples, leading us to believe we are observing objects that span from 
dense cores to clouds (see Dunham et al. 2010). 

Using the dust-dctcrmined mass and the source size, we compute the volume-averaged number density 



given a mean free particle weight, fi = 2.37 (Figure 23 (a)). Assuming a spherical volume for each source. 
The median value is rt = 2.5 x 10'^ cm^'^, which is within a factor of 3 of the masses determined in the NII3 
survey of Gem OBI, where they find a median volume-averaged density of n = 6.2 x 10^ cm~'^ (Dunham 
et al. 2010a). Compared to the IRDC sample of Peretto & Fuller (2010), we probe the same range of 
volume-averaged number densities from 100 to 10'* cm~^ . This low volume-averaged number density is a 
result of the large observed source sizes and, therefore, large volume observed toward BGPS clumps. The 
volume-averaged density is lower than the typical eff'ective excitation density for HCO+ and N2II+ J = 3 — 2 
emission. Steep density gradients are known to exist in both low-mass (Shirley et al. 2002) and high-mass 
(Mueller et al. 2002) star-forming clumps. If we were resolving individual dense knots, we would expect to 



see a trend of linewidth with volume- averaged density as well (Figure 23 (c)). Therefore, BGPS sources tend 
to be fairly low-density sources with possible compact high-density regions that may be probed with higher 
angular resolution observations. Some of these clumps of low density gas contain high density regions up to 
n = 10^ cm~'^, several orders of magnitude higher than the average source properties (Dunham et al. 2011). 
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Comparing the time scales of these clumps shows that the free-fall and crossing time scales are similar at 
a few times 10^ years. The median free-fall time is 8 x 10^ yr; thus, if these clouds were bound, it would 
take a few hundred thousand years to collapse and begin forming stars. With the masses of these clumps we 
also compute the virial linewidth as 

A, ,2 _ ?>\n2aGM^M _ °' km (19) 

^"^^"'^^ " 5R ~ [R/l pc)(A^;„b,/l km s'^f ' ^^^^ 

.= ^;.^1.5 (13) 

This definition includes the correction factor for a power-law density distribution of p=1.5 (Bertoldi & McKee 
1992, Shirley et al. 2003). The virial linewidth represents the velocity dispersion from internal motions due 
to self-gravity. Figure [2l](b) shows the virial linewidths are smaller on average than the observed linewidths, 
about 2-3 km s~^ indicating that many of the BGPS clumps are not entirely gravitationally bound. It is 
likely that there are smaller denser regions within these clumps that are gravitationally bound. 

We can also look at the virial parameter in terms of the surface density (see §4.4.1) of the clumps given 

by 

" 81n27rGEi?- ^ ' 



This is shown in Figure 21 (c). The virial parameter versus mass is also shown in Figure 21 (d). This is 
another way of showing that few of our clouds indicate virialized motion (a <~ 1) and most lie more than 
a factor of a few above this line. The caveat is we assume the BGPS traces ALL the mass which may not 
be entirely true. The BGPS does resolve out diffuse emission and the dense gas tracers are missing the 
low-density gas traced by CO or H I. It appears that most of the clumps we see are likely not gravitationally 
bound but do contain dense substructures that likely are. 



.1. Differential Mass Histogram 



Figure 22 (c) shows the differential mass histogram for the number of sources in bins of log(M). In this 



parameterization, the mass function takes on the form of 

dlo9{M) ' ^^^> 

where the power-law index is the slope of a line through the histogram. For the observed masses, calculated 
assuming a dust temperature of 20 K, we find a slope of a — 1 = 0.91. This slope is shallower than the slope 
of the Salpeter stellar IMF a — 1 = 1.35 for dN/dlog(M). Our observed slope is steeper than has been found 
for the mass distribution of CO clumps a — 1 = 0.6 — 0.7 (e.g., Scoville & Sanders 1987). Observations of 
dense-gas tracers tend to increase the differential mass histogram slope. For instance, Shirley et al. (2003) 
find a slope of a — 1 = 0.9 for their cumulative mass function for cores probed by CS J = 5 — 4. In comparison 
to IRDCs (Peretto & Fuller 2010), they find a slope of a - 1 = .85 for the total IRDC. This intermediate 
slope also suggests that we really are looking at the intermediate case of star forming "clumps" rather than 
clouds or cores. 



There are several sources of uncertainty in determining the slope of the differential mass histogram which 
we must characterize. The effect of binning has been shown to cause problems in studies of the stellar mass 
function and thus will also be problematic with the clump mass spectrum (Mafz Apellaniz & Ubeda 2005). 
For instance, the choice of binsize may have a substantial effect on the computed slope of the differential 
mass histogram. Bins with small numbers of sources dominate the fit if the binwidth is chosen too small. 
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and the number of bins used in a linear regression decreases rapidly if the binwidth is too large. Our data do 
not span many decades in mass above the the estimated completeness limit in the Known Distance Sample, 
and thus choosing an appropriate binwidth is difficult. We can circumvent this binning problem by using 
the maximum-likelihood estimator (MLE) to compute the power-law slope a (Clauset et al. 2007, Swift & 
Beaumont 2010). The MLE maximizes the likelihood that the data was drawn from a given model. If the 



data are drawn from the distribution given by equation 16 then the maximum likelihood estimate of the 
power-law index is a given by equation |17[ where Mmin is the lower bound of the power law behavior and 
n is the number of sources with mass greater than Mmin (Clauset et al. 2007), 



p{x) 



1 



M„ 



M 
M ■ 



(16) 



a = l 



El- 



(17) 



Another important source of uncertainty is the assumed dust temperature of BGPS sources. We initially 
assume T^ust = 20 K for every source in the Known Distance Sample to compute our masses (Merello et al., in 
prep). In reality, BGPS sources have a range of dust temperatures of an unknown distribution. We perform 
a Monte Carlo simulation of the differential mass distribution to constrain the range a by assuming that the 
source temperature distribution is approximated by a Normal distribution, characterized by mean Tmean and 
(t(T). We then compute the mass of each source with a random temperature drawn from the temperature 
distribution and the source flux that has an error term drawn randomly from a Normal distribution added 
in. For each T„iean and (j{T), the mass histogram is computed 10** times, and we calculate the median value 
of a. 



The variation of a with the temperature distribution is shown in Figure 24 (a). The effect of temperature 
distribution is significant but not strong. As part of the MLE procedure we estimate the best value of M^m, 
the minimum mass used in the power-law fit for each mass distribution. This is determined by minimizing 
the KS statistic between the best-fit model and Mmin as a function of Mmin (Clausset et al. 2007; Swift & 
Beaumont 2010). The dependence of Almtn on temperature means that colder temperatures lead to higher 
masses for a given observed flux. This dependence is also seen in the median of the mass distribution. Figure 



24 (b), and the median volume- averaged number density. Figure |24| (c). 



There are several important caveats that must be considered in this analysis. Foremost, the BGPS 
pipeline reductions will leave artifacts in the data. The process by which the sky variation is removed from 
the data will also remove some of the diffuse emission, resulting in spatial filtering of the 1.1 mm brightness 
distributions (Aguirre et al. 2011). Furthermore, the cataloguing algorithm is also more sensitive to peaked 
emission than low levels of diffuse emission (Rosolowsky et al. 2010). This means we are not seeing diffuse 
sources unless they are very bright. We are missing flux from extended emission surrounding the clumps we 
do see, thus leaving us with a lower limit on the total mass, and it also affects the derived sizes. This survey 
is flux-limited, which means we also suffer from Malmquist bias. The BGPS is sensitive to different types 
of sources (i.e. cores and clumps vs. clouds) as the distance increases (see Dunham et al. 2010). The last 
caveat is that the brightest BGPS clumps (most massive) are associated with H II regions which heat the 
dust and may contain a significant amount of free-free emission which would most likely lead to a steepening 
of the mass spectrum. However, we present one of the first differential mass distribution with statistically 
significant numbers showing that massive clumps appear to have a shallower slope than nearby low-mass 
cores. 
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Mass Surface Density 

While the derived quantities discussed above depend expUcitly on distance, one quantity that is distance 
independent is mass surface density. We compute the mass surface density while assuming T^ust — 20K 
from 

_ 37.2 ■ Sjy 2 /io\ 

- - -^2 g cm , (18) 

^ ^arcsec 

where S only depends on the the observed flux, observed source size, and the assumed dust temperature. 



Figure 25 (a) shows the mass surface density for the all of the BGPS sources we observed with determined 
fluxes and sizes from the vl.O catalog {N = 1684). Since dust temperature variations affect the derived 
mass ([ 4.4.1), we also perform a Monte Carlo simulation to calculate the median mass surface density for 
distributions with different Tmean ^^nd T^. The Monte Carlo simulation includes variations of the fluxes by 
the Normal distribution corresponding to the flux and error on the flux. Figure [25] (b) shows how the median 
of the surface density distribution changes with temperature. The median of the Tdust = 20 K distribution 
is 0.027 g cm^^ for the Full Sample and 0.033 g cm^^ for the Known Distance Sample, indicating that 
the properties derived from the Known Distance Sample are likely representative of the total sample. Both 
values are signiflcantly smaller than the results from Shirley et al. (2003) , where they find a median surface 
density of 0.605 g cm^^. Our median surface density is also much smaller than the 1.0 g cm^'^ threshold 
that Krumholz and McKee (2008) require for massive star formation (see Fall et al. 2010). 



The dependence of the median of surface density distribution on temperature is shown in Figure [24] (d) . 
The median mass surface density only increases by up to a factor of 2.5 for distributions with colder dust 
temperatures but they are still significantly below that of Shirley et al. (2003). The smaller mass surface 
density is likely a result of the larger clump sizes measured for BGPS clumps compared to previous surveys. 
However, our result does not indicate that these BGPS clumps are not capable of forming massive stars. 
It is likely that observations with higher spatial resolution will reveal the observed mass surface density 
approaching 1.0 g cm~^ for cores within BGPS clumps. Indeed, the infrared populations of BGPS clumps 
have recently been characterized by Dunham et al. (2011) who found that many BGPS clumps are in fact 
forming high-mass stars. They find that 49% of sources that are in the regions where the BGPS overlaps 
with other mid-IR Galactic plane surveys contain at least one mid-IR source. 



5. Conclusions 



We used the 10m Heinrich Hertz Telescope to perform spectroscopic follow-up observations of 1882 sources 
in the Bolocam Galactic Plane Survey. We simultaneously observed HCO+ J = 3 — 2 and N2H+ J — 
3 — 2 emission using the dual-polarization, sideband-separating ALMA prototype receiver. Out of the 1882 
observed sources, we detect ^ 77% of the sources in HCO"*" and over 50% in N2H+. Multiple velocity 
components along the line-of-sight to BGPS clumps in these dense molecular gas tracers are rare. Our 
detection statistics are somewhat biased toward more detections because this sample includes all of the 
intrinsically brightest sources. 



We find a strong correlation between peak temperature and integrated intensity of each dense gas tracer 
with each other and with the 1.1 mm dust flux. The median ratio of HCO+ integrated intensity to 1.1 mm 
flux is 5.42 K km per Jy/beam . We find that HCO+ is brighter than N2H+ for the vast majority of 
sources, with a subset of only 117 sources (12.6% of sources singly detected in N2H"'" ) where N2H"'" is the 
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brighter of the two dense gas traccirs. The ratio of the peak line temperature and integrated intensity of the 
two molecules does not correlate well with 1.1 mm dust emission. 

The observed vlsr appear to follow the same distribution with Galactic longitude as ^^CO J = 1 — 
emission observed by Dame et al. (2001). We determine the linewidths from a best-fit Gaussian model and 
find little change, on average, with Galactic longitude. The median linewidth is 2.98 km s~^, indicating that 
BGPS clumps are dominated by supersonic turbulence. Linewidths only modestly correlate with 1.1 mm 
flux. 

We compute kinematic distances for all detected sources and are able to break the distance ambiguity for 
529 of our detections in the first and second quadrants using IRDC coincidence, VLBA-determined parallax 

source coincidence, or proximity to the tangent velocity or known kinematic structures. Using the set of 
sources of known distance, we compute the radius, mass, and average density of the sources. We find the 
median source size to be 0.752 pc at a median distance of 2.65 kpc, typically larger than source sizes observed 
in published surveys of high-mass star-forming clumps and cores. Comparing linewidth to the physical size 
of the source, we do not find any compelling evidence for a size-lincwidth relation in our data. Wc find our 
sources lie above the relationships found by Caselli and Myers (1995) and have too small of a correlation to 
say much about Larson's relationship (Larson 1981). 

For an assumed dust temperature of 20K we find a median mass of ~ 300 Mq, a low median volume- 
averaged number density of 2.4 x 10^ cm~'^, and a median mass surface density of 0.03 g cm~^. The similarity 
of the median mass surface density between the full sample and the Known Distance Sample indicates that 
the sources in the Known Distance Sample are characteristic of the Full Sample. Compared to published 
surveys of high mass star formation, BGPS clumps tend to be larger and less dense on average. We also 
analyzed the variation in median mass and volume density using a Monte Carlo simulation of cores with 
various dust temperature distributions. From the differential clump mass histogram, we find a power-law 
slope (dN/dlogM) that is intermediate between the slope derived for diffuse CO clumps and the stellar IMF. 
Finally, a comparison of the virial linewidth to the observed linewidth indicates that many of the BGPS 
clumps in this survey have motions consistent with not being gravitationally bound. 

In the future, we plan to complete observations of the BGPS catalog for all sources with £ > 10° in the 
dense molecular gas tracers HCO+ and N2H+ with the HHT. This will be the largest systematic survey 
of dense molecular gas in the Milky Way. A series of follow-up observations are currently underway to 
characterize the physical properties (density, temperature) and evolutionary state of BGPS clumps. 

We would like to thank the operators (John Downey, Patrick Fimbres, Sean Keel, and Bob Moulton) and 
the stafi^ of the Heinrich Hertz Telescope for their excellent assistance through numerous observing sessions. 
Yancy L. Shirley is funded by NSF Grant AST-1008577. 



Facilities: HHT (ALMA Band 6) 
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A. Appendix A 

In this appendix, we derive the relationship between ar and aj for a Gaussian line profile. 
Assume that the spectral line is well described by a Gaussian function: 

T{v) = Tpk ■ e(-^ ln(2)-vy Avl„^^) ^^^^ 

Where Tpj. is the peak line temperature and Avfwhm is the FWHM linewidth. The integrated intensity 
of this line is thus defined by the integral of temperature with velocity over the line window. 



/ 



T{v) dv = Tpk ■ Avfwhm 



4 ln{2) 



(A2) 



We may then ask the question, if wc have a non-detection in integrated intensity, then how many ctt (baseline 
rms) correspond to 3cr/? Lets rewrite Tpk for the undetected Gaussian line as: 



Tpk = N- ar 

The standard formula for the uncertainty of the integrated intensity as given by 



(A3) 



(A4) 



where Avch is the velocity resolution of each channel in the spectrum, and Avint is the velocity interval 
integrated over which encompasses the entire line, the line window. We choose a line window that integrates 
over 99% of the area in the line which is 2 times the 3a velocity width of the Gaussian line. The ^ velocity 
width is related to the FWHM by: 

1 AvpwHM 

v{-) = 



e' ^4 ln{2) 

Thus 2 • 3(T„ (the full width of the line, in both directions) is: 

Avint = 6 • Avfwhm ■ \/4 ln{2) 



(A5) 



(A6) 



Therefore, the ratio of I to u/ is 



I 



N ■ ar ■ Avfwhm 



4 ln{2) 



\ 



ar ■ \/Avch ■ 

Setting the ratio ^ equal to 3 and solving for N gives 
which is approximately 



\/4 ln{2) J 



(A7) 



Avch 



Av 



FWHM 



N 



Avch 



(A8) 



(A9) 



Avfwhm 

For example, if we have a barely resolved line with Avch = 1 and Avfwhm = 1, a 4.880- detection in 
temperature of a Gaussian line would be a 3cr detection in integrated intensity. 
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Fig. 1.—: Calibrations of Jupiter, S140, and W75(OH) versus MJD for the USB and LSB. For almost all 
observing sessions, rjmb and I(T^) are well behaved and consistent from day to day. At MJD — 245400 = 
920 - 922, there were 3 days where rj^b in ^poi was significantly lower than the rest of the observations; this 
is also seen in the two line calibration sources in both sidebands. These days were taken to be independent 
and calibrated with the average values of ri,nb for those 3 days as shown in Table [ij 
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Fig. 2. — : The two upper panels, (a) and (b), plot rj^ni, determined from the USB and LSB for each 
polarization. The two lower panels, (c) and (d), compare the two polarization feeds against each sideband. 
The Spearman's rank coefBcients are printed in the upper left corner of each plot. 
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Fig. 3. — : 40" aperture fluxes for all of the BOLOCAT VI. 0, solid blue line, and 40" aperture fluxes for the 
BOLOCAT vO.7 positions that were observed with the HHT as determined from the vl.O maps, dotted red 
line. 
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Fig. 4. — : Galactic Longitude versus Galactic Latitude plot and histogram for the "Full Sample" (a) and the 
"Deep Sample" (b) of sources. The spikes in the histogram around £ =+10, +20 and +80 degrees, are due 
to the deep survey that was done after the initial survey was completed. The peaks at £ = +25, +30, and 
+50 degrees, are due to the intrinsic number of sources in those areas. For £ >+90 degrees, these sources 
represent star formation complexes in the outer galaxy regions observed by the BGPS. 
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Fig. 5. — : Sample HCO"*" spectra of the 4 main flag types, (a) flag = 1 - single detection, (b) flag — 2 
multiple detections, (c) flag = 4 - Line wing(s). (d) flag = 5 - Self-absorbed spectrum. 
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Fig. 6. — : Histograms of the baseline rms. (a) and cr/ (b)for all sources. Noise levels are lower than 100 mK 
rms in the baseline for the vast majority of sources (94% for HCO"'" and 89% for N2H+). The median 
baseline rms for HCO"'" and N2H"'" are 53.8 mK and 58.6 mK respectively. In cr/, the median errors are 0.124 
K km s-i and 0.138 K km s^^ for HCO+ and N2H+ respectively. 
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Fig. 7. — : A histogram of HCO+ FWHM determined from the Gaussian fit to the spectrum. The Gaussian 
fits were computed using the MPFIT package also described in {2.5.2 The median of the distribution of 
FWHM is 2.9 km s^^. As a comparision, the median of the FWHM from the second moment calculation is 
2.3 km s~^. The Gaussian linewidths for N2H+ are broadened due to hyperfine splitting and are not shown 
here. Fitting the hyperfine structure of N2H+ leads to a median FWHM of 3.5 km s~^ and are shown as the 
red curve. 
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HCO+ Detection Statistics (Deep Sample) 
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Fig. 8. — : Detection statistics for HCO+ and N2H+ for all 1882 sources. We detect 76.7% of our sources in 
HCO+ and 50.5% in N2H+ at the 3cr level. Out of our "Deep Sample" N = 707, we detect 73.3% of sources 
in HCO+ and 41.2% of sources in N2H+. 
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Fig. 9. — : To see how detection statistics change with 1.1 mm flux we plot the percentage of detected sources 
versus the percentile of its flux (e.g. the 25% of sources, by number, with the lowest flux, etc.). The highest 
percentile has a 99% detection rate in HCO+ and a 88% detection rate of N2H+. 




Fig. 10. — : (a) A histogram of integrated intensity of HCO+ and N2H+ for the "Full Sample" of sources. 
These histograms are not scaled with respect to each other. They are logarithmically binned. Note the lack 
of sources with bright N2H+ J 3 — 2 emission and that HCO"*" sources are on average brighter than the 
N2H+. The median 'i(Ji(HCO+) — 0-37 K km s^^ and median 3cr/(jv,ff+) = 0.41 K km s^^. (b)A histogram 
of peak line temperatures of HCO+ and N2H+ for the "Full Sample" of sources. These histograms are not 
scaled with respect to each other. They are logarithmically binned. While it is apparent from the T versus 
T plots that HCO"'" has a typically higher line temperature it is clear from these distributions there are two 
different cutoffs for the peak line temperature of HCO''" and N2H+. The average 3crTpfc(ffco+) = 0.16 K and 
the average 'i(^Tj,^(N2H+) = 0.17 K. 
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Fig. 11. — : A comparison of the integrated intensities and line temperatures of HCO+ and N2H"'' for the 
"Full Sample" of sources. The light blue circles represent the 952 sources that have both an HCO"'" and 
N2H+ detection and the blue contours overlaid represent the density of points on the plot. The light red 
circles are HCO"'" detections and 3a N2H"'" upper limits. The red contours represent these points. The green 
circles are detections in N2H+ that do not have a corresponding 3ct HCO"'" detection and are upper limits 
for HCO"'". The solid line plotted is the 1-1 line and the dotted line represents the best fit to the data, (a) 
A comparison of the integrated intensities (K km s^^) of HCO+ and N2H"'". The slope of the best fit line 
is m = 0.82. (b) A comparison of the Peak line temperatures in Kelvin of HCO+ and N2H"'". The average 
source has a higher line temperature (Tmb) in HCO+ than N2H'*" by a factor of ~ 2. The slope of the bestflt 
line is m = 0.83. 
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Fig. 12. — : The two top figures, (a) and (b), show the integrated intensity of the molecular line versus the 
1.1 mm flux/beam from the BOLOCAT vO.7 positions on the vl.O maps. The light blue circles are the 
sources detected in HCO"'"(a), the hght red circles are 3a upper hmits to the HCO+ hne strength. The blue 
and red contours trace the density of points on the plot. The second panel (b) is the same plot but N2H+ 
integrated intensity is used instead. There is a strong correlation between the molecular emission and the 
dust emission phco+ = 0.80 and PN2H+ = 0.73. The dotted line is the best fit to the blue points. We find 
the slopes of the two distributions to be mHco+ = 1-15 and mN^H+ = 1-28. The two lower figures, (c) and 
(d), show the peak line temperature of the molecular line versus the 1.1 mm flux/beam from the Bolocat 
vO.7 positions on the vl.O maps. The symbols are the same as the previous two plots. There is a strong 
correlation between the line temperature of the molecular emission and the dust emission Phco+ = 0.75 and 
PN2H+ = 0.73. We find the slopes of the two distributions to be mHco+ = 0.88 and mjq^H+ — 0.98. 
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Fig. 13. — : Ratio of molecular emission versus 1.1 mm flux/beam for all sources detected in both HCO+ 
and N2H+. The ratio of integrated intensities (a) and peak line temperatures (b) versus 1.1 mm emission 
are presented in these plots. The points in both diagrams are not well correlated with p = —0.1 for (a) and 
p — —0.2 for panel (b). The solid lines are where the ratio equals 1.0 and the dotted lines represent the 
median of the ratios for each panel, (a) 1.81, (b) 1.69. 
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Fig. 14. — : (a) The HCO^ ^lsb. determined from the Gaussian fit for each detected source versus Galactic 
longitude. The envelope formed by the largest velocity at each £ represents the tangent velocity. (6) The 
comparison of vlsr of HCO+ and N2H+. The scatter in the data is 0.71 km s^^ which is smaller than the 
width of a channel, (c) The HCO"'" linewidth determined from the Gaussian fit versus the Galactic longitude 
of each source. We also plot the median FWHM within bins in £ to emphasize any overall trends. There 
does not appear to be any strong relationship with £. 
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Fig. 15. — : (a) The HCO+ linewidth determined from the Gaussian fit versus the integrated intensity of 
the HCO+ hue. There is a moderate correlation, with a Spearman's Rank Coefficient oi p ~ 0.60. (b) The 
HCO+ hnewidth determined from the Gaussian fit versus the 1.1 mm flux per beam from the BOLOCAT vO.7 
positions, fight blue circles. There is a lot of scatter but there is a moderate correlation between linewidth 
and 1.1 mm dust emission, p — 0.54. 
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Fig. 16. — : Kinematic distance versus Galactic longitude for all sources detected and do not have a flag of 
2. The near and far distances are shown in blue circles and red boxes respectively. This version of the plot 
accentuates the number of sources that lie near the tangent points at each £. 
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Fig. 17. — : (a) Histogram of Galactocentric distance of all sources with a single HCO+ detection, 
(b) Linewidth versus Galactocentric distance, (c) 1{Tmb, HCO'^) versus Galactocentric distance, (d) 
^{TmBtN2H'^) versus Galactocentric distance, (e) 1.1 mm dust emission versus Galactrocentric distance, 
(f) Mass Surface Density (S) versus Galactrocentric distance. We overplot the median values the source 
properties in 1.5 kpc bins to look at trends in the data, only bins with > 20 sources are plotted. In all 
cases the sources in the outer galaxy have smaller median values than those in the first quadrant. This is 
most apparent in panel (f) the Mass Surface Density plot. 
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Fig. 18. — : A histogram of Heliocentric Distance for the Known Distance Sample of sources with the distance 
ambiguity resolved. The median of the distribution is 2.65 kpc. The spike of sources that lies around 1 kpc 
are sources that are in the range of ^ = 80 — 85° and sources in the outer Galaxy. We use other distance 
measurements for sources in these regions. 
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Fig. 19. — : Face on view of Galactic structure. Using the kinematic distances determined along with the 
Galactic longitude we make a polar plot presenting the face on view of the Milky Way. Sources in the 
Known Distance Sample are plotted as green triangles. In the first quadrant of the galaxy, sources with 
an unresolved distance ambiguity are plotted in blue circles and red boxes for the near and far distance 
respectively. These points are plotted twice to represent the distance ambiguity for those not associated 
with an object that breaks the degeneracy. The large orange circle represents the Galactic Center and the 
Sun is located at the origin. 
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Fig. 20. — : (a) The histogram of the source sizes, in pc, as determined from the distance and the angular 
size in BOLOCAT vl.O for the sources in the Known Distance Sample. The median source size is 0.752 pc 
and is shown by the dotted line. (6) shows the radius of each source versus the distance from us. This trend 
is primarily from the fact the radius is a function of distance for a given source size. The line plotted is the 
source size that corresponds to our beamsize at each distance. 
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Fig. 21. — : (a) Size linewidth relationship of the 529 sources that have determined distances. Overplotted is 
the Larson relationship from Larson (1981), black line. While it runs through the middle of our data, there is 
not enough of a trend to determine anything about it. Also plotted are the two powerlaws from Caselli and 
Myers (1995), R'^^ (light red line) for high mass and R^'^ (light blue line) for low mass, (b) Virial linewidth 
versus observed linewidth shows that our clumps are not dominated by motions due to self gravity with the 
majority of sources with observed linewidths larger than the virial linewidth. (c & d) Observed linewidth 
(1/e- width) vs surface density / mass compared with the virial parameter a. The colored lines represent 
different values of the virial parameter. This plot also shows that few of the clumps have a virial parameter 
less than 1 and are fully gravitationally bound. 
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Fig. 22. — : (a) Histogram of observed masses as determined from the 1.1 mm dust emission for the Known 
Distance Sample. The median of this distribution is 320 Mq and the mean is 1272 Mq. Our completeness 
hmit is ~ 140 Mq shown by the black dotted line based on a source of 1 Jy (above which we observe every 
object in the BOLOCAT) at our median distance. (5) The distribution of masses versus the distance from us. 
The main trend in this plot is due to distance dependence of the conversion from flux to mass, M oc D^. The 
black dotted line represents the mass corresponding to our minimum flux versus distance. The red dotted 
line represents a 1 Jy source versus distance, (c) Differential mass histogram with dlogM — 0.3 (binsize) 
assuming = 20K. The black dotted line is our completness limit, see (a). The slope of the line is fit from 
the largest mass peak of the distribution to the highest masses. The slope determined for this "binned" 
differential mass histogram is to = —0.70 which is not the slope we determine from the MLE analysis of 
TO = —0.91. This difference is directly caused by the fact we have binned the data and will change based on 
the way one choses the binsize. 
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Fig. 23. — : (a) The volume-averaged number density for the "Known Distance Sample" . The median value 
is 2.4 X 10^ cm~^ (red dotted line). We are complete to ^ 1300 cm^^ (black dashed line), (b) Aw versus the 
average density, (c) The average volume density of a source versus distance. We are able to measure higher 
densities for closer sources because source size increases with distance given our finite beamsize. Only nearby 
sources can be fully resolved. The red line indicates our volume-averaged number density completness vs 
distance for a ~ 140 Mq object, Mass completeness limit, with size equal to the beamsize, minimum resolved 
source size, (d) The clump free-fall timescale and the clump crossing time scales. The median free-fall time 
is ^ 8 X 10^ years. The median crossing time is ^ 5 x 10^ years. 
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Fig. 24. — : (a) The variation of the power law index with choice of temperature distribution, (b) A Monte 
Carlo simulation of the median mass with variations in temperature. The units are in Mq. (c) A Monte 
Carlo simulation of the median volume-averaged number density with variations in temperature. The units 
are in cm""^. (d) A Monte Carlo simulation of the median surface density for variations in Tdust- The units 
are g cm~^. 
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Fig. 25. — : (a) The surface density for the entire sample assuming Tdust — 20 K. The median value is 
0.027 g cm~^. The completeness limit for the entire sample is 0.051 g cm^^ which is based on a source 
of 1 Jy coupled with the beamsize (black dashed line), (b) The surface density histogram for the Known 
Distance Sample. The median value is 0.033 g cm~^. The completeness limit for the Known Distance Sample 
is 0.016 g cm^^ which is based on mass completeness and the median size of a clump (black dashed line). 
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Table 1. rjmb for each polarization and sideband. The errors are the standard deviation of all main beam 

efficiencies for the dates specified. 





Vpoi LSB 


Ypoi USB 


Hpoi LSB 


Hpo( USB 




HCO+ J = 3 - 2 


N2H+ J = 3 - 2 


HCO+ J = 3 - 2 


N2H+ J = 3 - 2 


T}mb - Average 


0.81(0.04) 


0.81(0.03) 


0.70(0.03) 


0.70(0.04) 


Timb - MJD 918 - 920 


0.64(0.01) 


0.64(0.02) 


0.64(0.01) 


0.64(0.02) 



Table 2. Line Flags 



Flag Number Flag Description 



No Detection: There is no visible line 

1 Single Detection: One line visible with no visible structure 

2 Multiple Detections: More than one line is visible in the spectrum 

4 Obvious Line Wings; One line that shows a possible line wing(s) 

5 Self-absorbed Profile: One line that shows a possible self-absorbed profile 

9 Unusable Data: Both polarizations have defects that make the spectrum unusable 
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Table 4. Summary of log fit parameters 



log X-Axis 


log Y-Axis 


Intercept 


Slope 


Spearman's Rank (p) 


I(N2H+) K km s-i 


I(HCO+) K km s-i 


0.33 


0.823 


0.82 


T„6(N2H+) K 


T^6(HCO+) K 


0.21 


0.83 


0.79 


I(HCO+) K km s-i 


S,y(l.lmm) Jy/Bcam 


.78 


1.15 


0.80 


I(N2H+) K km s-i 


Si/(l.lmm) Jy/Beam 


0.55 


1.28 


0.73 


T„i,(HCO+) K 


Si/(l.lmm) Jy/Beam 


0.21 


0.89 


0.75 


T^i,(N2H+) K 


Si/(l.lmm) Jy/Beam 


-0.01 


0.99 


0.73 
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